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INTRODUCTION 

Field of the Invention: 

This invention pertains to the field of nucleic acid chemistry, and to combinatorial 
chemistry as it is applied to nucleic acids. More specifically, the invention concerns 
procedures for obtaining oligonucleotides and their analogs that can serve a function, most 
preferably as ligands, receptors, and catalysts. More specifically, this invention relates to 
methods for selecting in vitro oligonucleotide derivatives that can serve as ligands, 
receptors, or catalysts, starting from a pool of oligodeoxyribonucleotide derivatives that 
incorporate non-standard nucleobases, or incorporate standard nucleobases modified to 
carry additional functional groups, or standard oligonucleotides acting in conjunction with 
organic cofactors. 

Background of the Invention 

The majority of pharmaceutical agents are compounds that exert their biological activity 
by binding to a biological macromolecule, referred to here as a receptor. The discovery of 
ligands that can bind to a preselected receptor and thereby exert a biological effect is an 
important goal of medicinal chemists seeking to develop new human pharmaceutical agents. 

Classically, ligands are discovered by three strategies. The first involves screening of a 
collection of chemicals whose structures have no deliberate connection with the structure or 
biology of the target receptor. This process is referred to as "random screening". 

The second strategy requires information about the structure of the natural ligand for a 
receptor. Development of new ligands then is based on the deliberate synthesis of specific 
analogs of the natural ligand in the hope of discovering a ligand that retains or has increased 
affinity for the receptor, together with bioavailability, stability, and other properties desired 
for a human pharmaceutical. 

The third strategy requires information about the structure of the receptor itself. With this 
information, ligands are designed by the design of structures that are complementary to the 
binding site of the ligand. 
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The deficiencies of these three approaches are well known to those familiar with the art. 
Random input screening often requires examination of thousands of compounds before a 
single ligand has a chance of being identified. Analogs of the natural ligands often resemble 
the natural ligand in terms of bioavailability , stability, or other properties; often, these 
properties are undesirable in a human pharmaceutical. Further, while tremendous strides 
have been made in the science of molecular recognition over the past decade, it is still not 
possible to design a ligand for a receptor even given a high resolution experimental structure 
for the receptor itself. 

One approach suggested for solving these problems comes under the tide of "in vitro 
selection". The approach has several implementations. Most commonly, a collection, or 
library, of oligonucleotides of random sequence is presented to a receptor, often attached to 
a solid support. The receptor binds to only a few oligonucleotides in the library. The 
oligonucleotides in the pool that do not bind to the receptor are then washed from the 
receptor. The oligonucleotides that bind to the receptor tightly are then eluted from the 
receptor and recovered. These are then amplified by polymerase chain reaction technology, 
well known in the art (Mullis et al., U.S. Patent 4,683,202), to yield a library of 
oligonucleotides whose members have a higher affinity, on average, than the members of 
the original pool. This new library is then subjected to mutation by methods well known in 
the art to create a new library of oligonucleotides with structures randomized around those of 
the starting library with increased affinity for the receptor. These are then subjected to the 
binding, elution, and amplification steps in repeated cycles, leading to oligonucleotides with 
increased binding activity. After several rounds of selection, a secondary library can next be 
prepared from a mixture of oligonucleotides already enriched with those that have affinity 
for a receptor, by amplifying the mixture using polymerases under conditions where the 
polymerase makes mistakes. The secondary library re-diversifies the library of ligands that 
already has some affinity to the receptor, permitting the in vitro selection experiment to 
search a region of combinatorial "sequence space" (Benner, S. A., Ellington, A. D. (1988) 
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CRC Crit. Rev. Biochem. 23, 369-426) that is likely to contain oligonucleotides with the 
desired properties. 

Those of ordinary skill in the art understand both the value of in vitro selection, and how 
to perform its individual steps in the laboratory. Preparation of mixtures of oligonucleotides, 
affinity purification of ligands from a mixture containing both ligands and non-ligands, 
amplification of oligonucleotides by PCR, and the mutagenesis of oligonucleotides are all 
well known in the art. In vitro selection has been recognized as being useful for obtaining 
ligands, receptors, and catalysts by many groups. Early articles by Joyce (Joyce, G. F. 
(1989) in RNA: Catalysis, Splicing, Evolution, Belfort and Shub, eds. Elsevier, 
Amsterdam, pp. 83-87), Irvine et al. (Irvine, D., Tuerk, C, Gold, L. (1991) 7. Mol Biol 
222, 739 -761), and Szostak (Szostak, J. W. (1992) Trends Biochem. Soc. 17, 89-93) 
layed out the elements of classical in vitro selection methodologies. For example, in vitro 
selection was used to obtain oligonucleotides as ligands for reverse transcriptase by Chen et 
al (Chen, H„ Gold, L. (1994) Biochemistry 33, 8746-8756). In the patent literature, Gold 
and Tuerk (U.S. Patent 5,270,163) propose in vitro selection methods on RNA libraries as 
an approach towards obtaining ligands and receptors. Gold (U.S. patent 5,476,766) 
propose an in vitro selection system for amplifying RNA to create ligands for thrombin. 
Jayasena and Gold (U.S. Patent 5,472,841) consider oligonucleotide libraries where the 2- 
hydroxyl group of the ribose ring of standard oligonucleotides may be replaced by a 2- 
amino group. In principle, when selection methods permit enrichment of oligonucleotides 
that bind to a transition state analog, catalysts might also be obtained by in vitro selection 
(U.S. Patent 5,270,163). 

It is clear that in vitro selection based on natural oligonucleotides has limited efficacy, 
however. As discussed in Serial No. 07/594,290, filed: 10/09/90, now issued as U.S. 
Patent 5,432,272, of which the instant application is a continuation-in-part, the primary 
difficulty with in vitro selection as a tool for creating molecules with desired properties 
arises from the fact that natural oligonucleotides do not themselves display a wide diversity 
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in either structure, or conformation, or functionality. Oligonucleotides that form the libraries 
used in classical in vitro selection experiments are built from only four building blocks. 
These building blocks carry relatively little useful functionality, especially compared to that 
carried by proteins. Thus, oligonucleotides with tight affinity for some receptors occur only 
infrequently in a population of random oligonucleotides. This makes it difficult to identify 
oligonucleotides within a library of standard oligonucleotide sequences that have satisfactory 
binding or catalytic properties for a range of ligands and reactions. 

SUMMARY OF THE INVENTION 
The instant invention provides improvements to classical in vitro selection methods 
already known in the art, to increase the effectiveness of in vitro selection experiments as a 
tool for generating new structures (e.g., bent oligonucleotides), ligands, receptors, and 
catalysts. The improvements comprise incorporating into the oligonucleotide analogs that are 
the components of a library to which in vitro selection methods functionalized standard 
nucleotides and non-standard nucleotides, both functionalized and not functionalized, and 
incorporating functionalized cofactors into the in vitro selection experiment. 

BRIEF DESCRIPTION OF THE DRAWINGS xxx 
Fig.l shows the two standard base pairs formed between the standard bases, e.g. C and 
G; T and A. To systematize the nomenclature for standard nucleobases, pyrimidines are 
designated by the prefix "py", purines by the prefix "pu n . Following the prefix is the order, 
from the major groove to the minor groove, of acceptor (A) and donor (D) groups. Thus, 
both thymine and uracil implement the standard hydrogen bonding pattern py ADA. The 
standard nucleobase cytosine implements the standard hydrogen bonding pattern pyDAA. 
The standard nucleobases adenine and guanine implement the standard hydrogen bonding 
pattern puDA- and puADD respectively. 
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Fig. 2 shows the specific structures of eight different base pairs that include the preferred 
embodiments according to the subject invention. To systematize the nomenclature for non- 
standard nucleobases, pyrimidines are designated by the prefix "py", purines by the prefix 
"pu". Following the prefix is the order, from the major groove to the minor groove, of 
acceptor (A) and donor (D) groups. 

Fig. 3 shows the general structures of eight different base pairs according to the subject 
invention. The notation -R designates the point of attachment of the ribose, deoxyribose, or 
ribose or deoxyribose derivative, X is either a nitrogen atom or a carbon atom bearing a 
substituent Z, Z is either a hydrogen, an unfunctionalized lower alkyl, alkynyl, or alkyl- 
alkynyl chain, or a lower alkyl, alkynyl , or alkyl-alkynyl chain bearing an amino, 
trifluoroacetamido, carboxyl, hydroxy, thiol, aryl, indole, or imidazoyl group, Y is either N 
or CH, and the ring contains no more than three nitrogens consecutively bonded. 

DETAILED DESCRIPTION OF THE INVENTION AND PREFERRED EMBODIMENTS 

Three approaches might be envisioned to circumvent limitations of in vitro selection 
systems that are intrinsic in the limitations in the structure of conventional oligonucleotides 
built fully from conventional nucleotides (ribo or deoxyribo adenylic acid, guanylic acids, 
and cytidylic acid, and uridylic acid or thymidylic acid, referred to by convention as A, G, 
C, T and U). The first is based on an understanding of how complementary oligonucleotides 
recognize each other, and how complementarity is used in polymerase chain reaction 
amplification of oligonucleotides during an in vzfrcLselection cycle. The invention disclosed 
in Serial number 08/375,132 (filed 01/17/95, for "In vitro Selection with Non-standard 
Bases") discloses an invention where the number of nucleobase building blocks that can be 
incorporated and independently replicated in an oligonucleotide analog is increased, where 
the additional nucleobases present non-standard patterns of hydrogen bonding groups to a 
complementary strand, and therefore can be independently copied in a polymerase chain 
reaction (PCR). In this context, "non-standard" refers to the pattern of hydrogen bonds that 
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holds a nucleobase to its complement in a complementary strand in a Watson-Crick 
geometry. The non-standard nucleobases confer additional function on an oligonucleotide 
analog not found in a conventional oligonucleotide. 

The second involves the addition of "cofactors" to the library. Cofactors are defined here 
to mean low molecular weight substances that carry one or more functional groups, and that 
bind either covalently (for example, through an inline linkage or a disulfide bond) or non- 
covalently to an oligonucleotide and deliver the valuable functionality to a non-covalent 
complex between the cofactor and the oligonucleotide. Cofactors are distinct in that they are 
not encoded in the oligonucleotide,and therefore the information concerning their structure is 
not amplified during a PCR cycle. Cofactors are well known in protein chemistry. For 
example, natural protein polymers do not contain amino acids that have aldehyde or ketone 
functionality, a redox active center, or a unit that can perform Umpolung chemistry. This 
means that proteins have an mtrinsically low ability to catalyze reactions that require these 
functionalities. Natural proteins circumvent this limitation by using the cofactors pyridoxal 
phosphate, flavin adenine dinucleotide, and thiamine, respectively, small molecules that bind 
non-covalently to a protein built with standard amino acids, and once bound, provide the 
functionality missing in the building blocks of a standard protein. The binding and catalytic 
potential of oligonucleotides can also be improved by the use of cofactors, and cofactors can 
be designed to facilitate their non-covalent interactions with oligonucleotides. 

Last, additional functionality can be introduced into an oligonucleotide by covalently 
attaching this functionality to standard nucleotides. This requires that a modified 
oligonucleotide ("modified" because it differs from a natural oligonucleotide by carrying 
appended moieties not found in natural oligonucleotides) be built to include nucleoside units 
that are modified, or derivatized, to carry functionality on the nucleoside. Because in vitro 
selection experiments require amplification of modified oligonucleotides, and because this 
amplification is normally done using a poiyperase obtained from Nature, the most useful 
modifications are placed at positions in the oligonucleotide where modification is known to 
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be tolerated by polymerases. On the nucleobase, these modifications either the sugar or the 
nucleobase. On the sugar, modification is preferred at the 2-position, preferably 2'-deoxy- 
2-amino nucleosides, or where functional groups are appended to the 2-oxygen of a 
ribonucleoside in the biopolymer or in a ribonucleoside triphosphate. On the nucleobase, the 
modification are preferably appendages on the 5-position of pyrimidines, the 7-position of a 
7-deazapurine, the 6-position exocyclic substitution (normally an amino group), or less 
preferably at the 2-position exocyclic substitution (normally an amino group). In non- 
standard nucleobases where the heterocycle numbering system is different, these 
modifications are placed at the structurally analogous positions. Serial No. 07/594,290 filed 
October 9, 1990, now issued as U.S. Patent No. 5,432,272, and its continuations-in-part of 
which this application is a continuation-in-part, as well as Serial No. 08/094,363, disclosed 
and claimed a variety of these structures, including nucleoside analogs earring amino, 
carboxyl, hydroxyl, thiol, aryl, indole, and imidazole groups. Through manipulation of the 
side chain functionality using chemistry well known in the art, a large set of functionalized 
oligonucleotides can be prepared suitable as a starting point for in vitro selection. 

These approaches for increasing the diversity of functions that might be obtained by in 
vitro selection are not mutually exclusive. That is, they can be used at the same time. Indeed, 
the greatest diversity of functionality can be obtained from an in vitro selection experiment 
involving libraries of oligonucleotide analogs can be gained if all three strategies are used 
simultaneously, that is, if the oligonucleotide analogs contain non-standard nucleobases, 
modified standard nucleobases, and if the in vifro selection experiment is done in the 
presence of cofactors. Indeed, as discussed below, simply adding functionalized standard 
nucleosides, modified either on the nucleobase or on the sugar, in the absence of non- 
standard nucleobases as part of an expanded genetic alphabet, can lead to 
overfunctionalization. 

Serial No. 07/594,290, now issued as U.S. Patent 5,432,272, outlined the structural 
features of the Watson-Crick base pair that serves as the basis for the amplification of 
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nucleic acids by enzymes such as DNA and RNA polymerase and in the polymerase chain 
reaction (PCR). The disclosure also noted that nucleobases to which are appended moieties 
containing functional groups can provide oligonucleotide analogs that have properties that 
are not readily accessible, or perhaps not accessible at all, in natural oligonucleotides. For 
example, the disclosure pointed out that a single oligonucleotide sequences containing a T to 
which is appended as side chain bearing a biotin residue can first bind to a complementary 
oligonucleotide and the hybrid can then be isolated by virtue of the specific affinity of biotin 
to avidin (Langer, P. R., Waldrop, A. A., Ward, D. C. (1981) Proc. Nat Acad. ScL 78, 
6633-6637). The latter is not a property readily available to a natural oligonucleotide. 
Further, it was stated that such modified natural nucleosides, as triphosphates, could serve 
as substrates for DNA and RNA polymerases, and be incorporated enzymatically into an 
oligonucleotide. 

Not found in the prior art is a disclosure that functionalization of natural nucleobases can 
increase the likelihood that an oligonucleotide with a selectable property can be found in a 
pool of oligonucleotide derivatives of random sequence. Serial No. 07/594,290, issued as 
U.S. Patent 5,432,272 pointed this out in specific cases, and Serial No. 08/375,132 
(pending) amplified this point for the general case. Serial No. 08/375,132 pointed out that 
libraries of oligonucleotide analogs built from a combination of natural nucleotides, natural 
nucleotides bearing functionalized side chains, or nucleotide building units bearing 
nucleobases with one of the non-standard hydrogen bonding patterns, would be useful in in 
vitro selection experiments. Serial No. 08/094,363. disclosed specific modified standard 
nucleobases that would be useful for this purpose, in particular, to expand the catalytic 
power of oligonucleotides. 

This invention is preferably characterized as an improvement on standard methods for 
performing in vitro selection, not obvious from the prior art covering in vitro selection 
performed with only standard, unmodified nucleotides. The improvement constitutes 
incorporating into the procedure for synthesizing a library one or more building units that 
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cany functionalized standard nucleobases, or non-standard nucleobases, or both, to create 
libraries that contain (in the first case) "modified oligonucleotides" (to indicate that they 
differ from natural oligonucleotides in their appended moieties, but not in the pattern of 
hydrogen bonding), or (in the second case) "non-standard oligonucleotides" (to indicate that 
they differ from natural oligonucleotides in their pattern of hydrogen bonding, at least in 
part). Collectively, we refer to modified oligonucleotides and non-standard oligonucleotides 
as oligonucleotide analogs, distinct from conventional (or natural, or unmodified) 
oligonucleotides. Alternatively, the in vitro selection can start with oligonucleotide libraries 
composed entirely of conventional oligonucleotides, but where cofactors designed to 
contribute functionality to the oligonucleotides in the library via covalent binding. 

While virtually any functionality can be incorporated into a nucleotide building block, 
standard or non-standard, both experiment and theory show that certain modifications are 
more useful than others. The most useful modifications carry functionality not found in 
natural oligonucleotides, including general acids and general bases with pKa values near 7, 
positively charged side chains, and aliphatic hydrophobic moieties. Therefore, in the 
preferred embodiment of the instant invention, certain constraints are desirable. 

First, the nucleotide building block analogs that are incorporated into the oligonucleotide 
analogs that are the components of the library, once incorporated in the oligonucleotide, 
should not prevent said oligonucleotide from serving as a template for a polymerase. 
Further, in the case of non-standard nucleotide building blocks, the non-standard nucleobase 
and its complement must both be able to serve in both the template and as a triphosphate in a 
polymerase-based amplification system. Last, for both standard and non-standard 
oligonucleotides, it is convenient (but not necessary) for subsequent rounds of selection if 
the nucleoside analog triphosphate can also serve as a substrate in the final step of the 
amplification, the step that generates the new pool of oligonucleotides for in vitro selection. 

Serial No. 07/594,290, now issued as U.S. Patent 5,432,272, discloses methods for 
achieving such amplifications with non-standard nucleobases, where the most preferred 
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non-standard pyrimidine analog is 3-p-D-ribofuranosyl-(2,6-diaminopyrimidine), or its 2- 
deoxyribose analog. The presently preferred complementary purine is xanthosine, or its 2'- 
deoxyribose analog. 

Since Serial No. 07/594,290 was filed, continued experimental work has increased the 
number of embodiments of the method claimed, and the compositions of matter that can be 
prepared using the method. These are disclosed in Serial No. 08/375,132. An even wider 
range of polymerases has been shown to incorporate non-standard base pairs into duplex 
oligonucleotides, including the reverse transcriptase from human immunodeficiency virus 1, 
especially this polymerase with the smaller subunit removed, and the thermostable 
polymerases from Thermotoga maritima, Pyrodictium abyssU and Pyrodictium occultum. 
The py(ADD) base in its pyrazine form has been incorporated into an oligonucleotide using 
T4 RNA ligase (J. J. Vogel, S. A. Benner (1994) J. Am. Chem. Soc. 116, 6929-6930), 
which is incorporated herein by reference. Further, since Serial No. 07/594,290 was filed, 
continued experimental work has also increased the number of routes available for the 
preparation of non-standard bases as components of oligonucleotides. (J. J. Vogel, M. M. 
Altorfer, S. A. Benner. Helv. Chim Acta 76, 2061-2069 (1993); Vogel, J. J., Benner, S. 
A. Helv.Chim.Acta 79, 1881-1898 (1996); Vogel, J. L, Benner, S. A. Helv. Chim.Acata 
79, 1863-1880 (1996)). Additional information has been gathered concerning the properties 
of the non-standard heterocyclic systems. J. J. Vogel, U. von Krosigk and S. A. Benner. /. 
Org. Chem. 58, 7542-7547 (1993), and further heterocyclic ring systems have bee defined 
that implement the non-standard hydrogen bonding patterns disclosed in Serial No. 
07/594,290. Further, functional groups Z have been found to be preferred when the 
attachment to carbon is via an acetylenic linkage, as disclosed in Hobbs, Jr., F. W. (1989) 
(/. Org. Chem. 54, 3420-3422). The py(DDA) and py(ADD) implementations have been 
found to be most preferably implemented on carbocyclic analogs of ribose and deoxyribose 
rings. 
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When the improvement involves incorporation of a modified standard nucleotide, the 
requirements for interaction with polymerases are less stringent. During the amplification 
cycles, the modification need not be present in the standard nucleobase, permitting the 
amplification to proceed as in a classical in vitro selection experiment. However, in the first 
copying in the amplification, the modified nucleotide must be able to serve as a template. 
Further, while not necessarry to identify the amplified sequences, it is desirable for 
subbsequent rounds of selection if the modified nucleotide can be incorporated into the new 
library in the final polymerase cycle, and that the triphosphate of the modified nucleoside be 
acceptable at least to some extent. As noted in Serial No. 07/594,290, now issued as U.S. 
Patent 5,432,272, pyrimidines carrying appendages at the 5-position are accepted by many 
polymerases. Experiments in the inventor's laboratory now have shown that these 
pyrimidines also serve as templates for polymerization catalyzed by the Klenow fragment of 
DNA polymerase I from E. coll Thus, the presently preferred embodiment of the instant 
invention that introduces functionality on standard nucleobases, does so by modifications of . 
2-deoxyuridine and 2-deoxycytidine at the 5-position of the pyrimidine ring, uses the 
Klenow fragment or one of its homologs as the polymerase for the initial and final rounds of 
amplification, and uses non-functional nucleoside triphosphates in intermediate rounds of 
amplification. Likewise, adenosine triphosphate derivatives earring appendages on the 6- 
position exocyclic amino group are also substrates for certain polymerases, such as T7 RNA 
polymerase (Folsom, V., Hunkeler, M. J., Haces, A., Harding, J. D. (1989)Ana/. 
Biochem. 182, 309-314). 

The improvement involving adding a functionalized cofactor to the library is the least 
demanding on the polymerase. The presently preferred cofactors are short 
oligoribonucleotides of defined sequence, preferably 4-6 nucleobases in length, whose 
2\3'-diol end has been cleaved by periodate following a procedure well known in the art, 
and the resulting dialdehyde reacted with a functionalized amine in the presence of sodium 
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cyanoborohydride to give a morphilo derivative carrying a fucntionality at the 3'-end. The 
presently most preferred amine is histamine. 

As is well known in the art, in vitro selection may be used to generate catalysts as well z 
ligands. Methods for the selection of oligonucleotides that have a preselected catalytic 
activity from a library of oligonucleotides having a region of randomized sequence are well 
developed in the published literature (see Ekland, E. H., Bartel, D. P. (1996) Nature 
382,373-376, and references therein). Briefly, selection for catalytic oligonucleotides 
requires that the reaction catalyzed transform the structure of the catalytic oligonucleotides ii 
a way that either allows them to be partitioned from the non-catalytic oligonucleotides, or 
allows them to be amplified in preference to the non-catalytic oligonucleotides. To the first 
end, the catalytic reaction may affix a biotin covalently to the catalytic oligonucleotides, 
allowing them to be partitioned from the non-catalytic oligonucleotides using an aviding 
affinity column. To the second end, the catalytic transformation may join a primer binding 
site to one end of the catalytically active oligonucleotides, allowing them to be preferentially 
amplified in a PCR. The improvement of the instant invention comprises including non- 
standard nucleobases into the oligonucleotides in the library, or including functionalized 
nucleobases into the library, or adding functionalized cofactors into the in vitro selection 
mixture. Selection for catalysts is especially well suited for the cofactor strategy, as the 
cofactor can be added to the selected oligonucleotide in a situation where the catalystic 
activity has practical application. 

The introduction of nculeotide analogs into an oligonucleotide nearly always expands the 
conformational versatility and range of reactivity of the oligonucleotide analog when 
compared with the unmodified or fully standard oligonucleotide. This implies that the 
oligonucleotide analog essentially always has a higher probability of having a preselected 
behavior. As a corollary, this implies that a library comprised of oligonucleotide analogs will 
contain more individual oligonucleotides with the preselected behavior than the 
corresponding library built entirely from natural oligonucleotides. 
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However, overfunctionalization can generate oligonucleotides that no longer behave well, 
as discussed in Serial No. 07/594,290, now issued as U.S. Patent 5,432,272. For 
example, introducing hydrophobic moieties into a large fraction of the nucleobases in an 
oligonucleotide can decrease the solubility of the oligonucleotide, causing aggregation and 
precipitation. Further, introducing multiple functionality into an oligonucleotide can create 
undesired catalytic powers, most significantly ribonucleolytic activity that destroys the 
library. For these reasons, the preferred embodiment has less than 15% of the nucleotide 
units containing functionality. Serial No. 07/594,290 disclosed that non-standard 
nucleobases permit the best control over the level of fiinctionalization present in an 
oligonucleotide. When using functionalized standard nucleobases, the preferred embodiment 
introduces a single nucleobase (C, for example) only in modified form. The fraction of C in 
the oligonucleotide library is less than 15%, while the fraction of the other nucleotides is 
correspondingly higher, in the presently preferred embodiment. 

Further, the presently preferred embodiment constructs libraries based on oligo-2- 
deoxyribonucleotide analogs, as these are more stable to hydrolytic cleavage than 
oligoribonucleotide analogs. 

Preferably the oligonucleotide analogs that form the components of a library of 
oligonucleotide analogs are from 20 to 300 nucleotides in length. These are most preferably 
prepared using terminal transferase to catalyze the elongation of a "primer" of defined 
sequence in the presence of nucleoside triphosphates and nucleoside triphosphate analogs. 
The ratio of triphosphate molecules to primer molecules determines the average length of the 
oligonucleotides in the library. This average length is assessed by agarose gel 
electrophoresis with ethidium bromide used as a visualizing agent. The ratio of building 
blocks in the library is determined by the ratio of triphosphates in the incubation mixture 
with terminal transferase. To assess the overall composition of the pool, a sample of the 
mixture is treated with deoxyribonuclease, the nucleotide products dephosphorylated using 
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alkaline phosphatase, and the nucleoside products resolved and quantitated by high 
performance liquid chromatography. 

Cofactors can be used in many different ways in an in vitro selection experiment. In the 
most general sense, a cofactor is anything that contributes to the functioning of the 
oligonucleotide is not encoded in the sequence of the oligonucleotide or the oligonucleotide 
analog. In this very general sense, the divalent cation (e.g. magnesium) present in many in 
vitro selection experiments is a cofactor. It permits oligonucleotides to adopt conformations 
that are important for function as a ligand, receptor, or catalyst, and may also contribute 
reactivity that is essential to the catalytic behavior of the oligonucleotide or oligonucleotide 
analog. 

Cofactors that are the objects of the instant invention can be more narrowly defined. 
First, the instant invention includes only those cofactors that present organic functionality to 
the library, that is, those whether the functional group is constructed from the elements 
carbon, hydrogen, nitrogen, oxygen, halogen, sulfur and phosphorus. The preferred 
functional groups are amine, guanidinium, hydroxyl, carboxylate, thiol, and heterocycles. 
In this contest, aliphatic hydrophobic appendages are also regarded as "functional" (even 
though they contain no functional groups), because they confer upon an oligonucleotide a 
property (aliphatic hydrophobicity) that is not found in conventional oligonucleotides. Most 
preferably is functionality important for catalysis, specifically imidazole, as disclosed in 
Serial number 08/094,363, functionality that contributes cationic groups, specifically amines 
and guanidiniums, and functionality that permits the cofactor to adopt unusual three 
dimensional conformations, specifically thiols. Several of these functional groups are found 
in the modified nucleobases disclosed herein. They can also be provided by cofactors. 

Next ,we consider cofactors of five classes based on their potential for interacting with an 
oligonucleotide. The first interact primarily through coulombic interactions between a 
positively charged cofactor and the anionic phosphate backbone of an oligonucleotide. The 
polyamines spermine and spermidine are examples of cofactors of this class. The interaction 
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between a polycationic cofactor and an oligonucleotide can be supplemented by secondary 
actions that confer specificity to the binding, that is, that causes this class of cofactor to bind 
preferentially to one segment of an oligonucleotide chain over another. This specificity in 
binding can be selected in the in vitro selection experiment. 

The second class interacts with an oligonucleotide by intercalation. Again, the specificty 
of binding is low. 

The third class of cofactor interacts with an oligonucleotide by interacting with the 
nucleobases in a non-intercalative sense, and also in a non- Watson-Crick sense. Netropsin 
is an example (Zimmer, C. (1975) Prog. Nucleic Acid Res. Mol Biol 15, 285-318). The 
specificity of binding is determined by the sequence of the oligonucleotide, in particular, 
how that sequence places hydrogen bond donating and accepting groups into the major and 
minor grooves. 

A fourth class of cofactor interacts with an oligonucleotide primarily through formation 
of Watson-Crick base pairs, with Watson-Crick specificity being the primary basis for 
selectivity of binding. The presently preferred cofactors of this class are short 
oligonucleotides between 2 and 10 nucleotides in length to which is appended functionality. 
The most preferred cofactors are generated by treating ribonucleotide meeting this 
description with sodium periodate, condensing the resulting 2',3'-seco dialdehyde with a 
primary amine carrying additional functionality, and reducing the adduct to yield a 
morphilino analog of the 3'-terminal ribonucleoside following the procedures of Khym 
(Khym, J. X. (1963) Biochemistry 2, 344), Brown & Read (Brown, D. ML, Reed, A. P. 
(1965) J. Chem. Soc. 1965, 5072) and Stirchak et al. (Stirchak, E. P., Summerton, J. E.., 
Weller, D. D. (1989) Nucleic Acids Res. 17, 6129-6141). The preferred primary amines are 
short polypeptides from two to six amino acids in length, with the residues in the peptide 
selected from the group consisting of the 20 proteinogenic amino acids excluding lysine. 
These can be prepared by automated solid phase synthesis by procedures well known in the 
art. 

16 



SUL-402D1 



The fifth class of cofactor interacts with an oligonucleotide or an oligonucleotide analog 
through the formation of a covalent bond. The presently preferred mode of interaction is a 
covalent bond between the cofactor and the functional group of a functionalized nucleobase. 
For example, a library of oligonucleotide analogs incorporating one or more 
mercaptopropargyl side chains is subjected to an in vitro selection experiment for catalysis in 
an acetate-borate buffer at pH 8.0. To this incubation mixture is added one or more 
functionalized thiols as mixed disulfides with 4-nitrothiosalicylic acid in a 1.5 fold excess. 
The mixture undergoes disulfide exchange rapidly under these conditions, yielding 
oligonucleotide analogs to which are appended functional groups via a disulfide bond. The 
presently preferred cofactors are short polypeptides from two to six amino acids in length, 
each peptide carrying exactly one cysteine residue, with the remainder of the residues in the 
peptide selected from the group consisting of the proteinogenic amino acids excluding 
cysteine. These can be prepared by automated solid phase synthesis by procedures well 
known in the art. 

As another example, a library of oligonucleotide analogs incorporating one or more 
aminopropargyl side chains is subjected to an in vitro selection experiment for catalysis in an 
acetate buffer at pH 6.0. To this incubation mixture is added one or more functionalized 
aldehydes. The aldehyde rapidly forms an imine with the amino group of the side chain, 
yielding oligonucleotide analogs to which are appended functional groups via a C=N bond. 
The presently preferred cofactors are prepared from short peptides from two to six amino 
acids in length, each peptide having an N-tenninal serine, with the remainder of the residues 
in the peptide selected from the group consisting of the proteinogenic amino acids excluding 
lysine. These can be prepared by automated solid phase synthesis by procedures well 
known in the art. When such peptides are treated in aqueous solution with sodium periodate, 
the N-terminal serine is converted to a terminal glyoxylic acid residue. 
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EXPERIMENTAL 



Example 1. Synthesis of derivatives of uridine and 2'-deoxyuridine suitable for 
incorporation into an oligonucleotide library. 
N-propargyltrifluoroacetamide 

This compound has been prepared using a literature method (K. A. Craickshank, D. L. 
Stockwell (1988) Tetrahedron Lett. 29, 5221). Propargylamine (90.9 mmol, 5 g, 6.227 
mL) was added to methanol (90 mL) and the solution cooled to 0°C. Ethyl trifluoroacetate 
(1.3 equiv., 118.2 mmol, 16.8 g, 14.1 mL) was added and the mixture stirred at room 
temperature for 24 h. The solvent was evaporated, the residue diluted with 90 mL 
chloroform and washed with aqueous NaHC03 solution (2 x 75 mL) and water (1 x 75 
mL). The organic layer was evaporated and the orange residue distilled in vacuum to give N- 
propargyltrifluoroacetamide (13.171 g, 96%) as a colorless liquid which solidifies at -20°C. 
bp: 41°C/1.5 mm (bp ref.: 51°C/2.5 mm), mp: 31-33°C *H-NMR (CDC1 3 ): 2.34 (m, 1H, 
CH), 4.15 (dd, 2H, CH 2 ), 7.80 (br s, H, NH). iH-NMR ref. (CDCI3): 2.34 (t, 1H), 4.16 
(dd, 2H), 7.00 (br s, 1H). 13 C-NMR (CDCI3): 29.4 (CH 2 ), 72.4 (CH), 77.0 (C), 
109.859, 113.673, 117.470, 121.276 (CF 3 , J = 286.37 Hz), 156.748, 157.249, 157.750, 
158.250 (CO, J = 37.77 Hz). 

5-(3-Trifluoroacetamidopropyn-l~yl)-2 '-deoxy uridine 

The strategy to prepare 5-position substituted nucleobases involves coupling a 5- 
iodonucleobase derivative with an acetylene derivative using either a Pd(II) catalyst (e.g. 
(Ph 3 P) 2 PdCl2, M. J. Robins and P. J. Barr, J. Org. Chem. 48, 1854 (1983) for the 5*- 
DMT-protected uridine derivative (see also J. Haralambidis, M. Chai, and G. W. Tregear, 
Nucl. Acids Res. 15, 4857 (1987); K. J. Gibson and S. J. Benkovic, Nucl Acids Res. 15, 
6455 (1987))), or a Pd(0) catalyst (e.g. (Ph 3 P) 4 Pd) (M. J. Robins, R. S. Vinayak, and S. 
G. Wood, Tetrahedron Lett. 31, 3731 (1990)). The use of Pd(0) requires shorter reaction 
times and gives purer products in better yields. Anaogous recipes for preparing 
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analogouscompounds can be found in (G. T. Crisp and B. L. Flynn, /. Org. Chem. 58, 
6614 (1993).). 

S-CS-Trifluoroacetamidopropyn-l-y^-I'-deoxyuridine is mentioned in the literature (A. 
L. Casalnuovo and J. C Calabrese, J. Am. Chem. Soc. 112, 4324 (1990). F. W. Hobbs, 
Jr., /. Org. Chem. 54, 3420 (1989)), although without complete synthesis recipe or 
spectroscopic data. 5-Iodo-2'-deoxyuridine (1.4 mmol, 500 mg) was dissolved in dry 
dimethylformamide (DMF, 12 mL) and Ar was bubbled through this solution for 10 min. 
Then, (Pti3P)4Pd (0.1 equiv., 0.141 mmol, 163 mg) was added and Ar was bubbled 
through the solution for another 5 min. Triethylamine (2.0 equiv., 2.8 mmol, 285 mg, 
0.393 mL) was added via syringe followed by addition of N-propargyltofluoroacetamide 
(2.5 equiv., 3.53 mmol, 533 mg and Cul (0.2 equiv., 0.282 mmol, 53.7 mg). The mixture 
was stirred at 40°C for 5 h, the solvent was evaporated, and the residue dissolved in 
MeOH/methylene chloride 1:1 (10 mL). Ion exchange resin (Bio-Rad AG1 X8, HCO3" 
form, 1.5 g, prepared from the chloride form by eluting through a column with the 16 fold 
volume of 1 M NH4HCO3 solution followed by deionized water and finally with 0.5 M 
NH4HCO3 solution; no CI" was detected) was added to remove the Et3N*HI side product, 
and the mixture stirred at room temperature for 30 min. The mixture was passed through 
Celite, the solid washed with MeOH/methylene chloride 1:1 (10 mL) and the solvents 
removed by rotary evaporation. The residue was purified by column chromatography 
(chloroform/MeOH 8.25:1.75) to yield 5-(3-trifluoroacetamidopropyn-l-yl)-2'- 
deoxyuridine (ca. 95%) contaminated with some DMF. Rf 0.42 (chloroform/MeOH 

8.25:1.75). iH-NMR (DMSO-d 6 ): 2.13 (m, 2H, 2'), 3.60 (m, 2H, 5'), 3.81 (m, 1H, 4'), 
4.24 (m, 3H, H-9, 3'), 5.12 (t, 1H, 5'-OH), 5.27 (d, 1H, 3'-OH), 6.11 (t, 1H, 1'), 8.22 
(s, 1H, H-6), 10.09 (t, 1H, NH chain), 11.67 (s, 1H, NH cylc). 13 C-NMR (DMSO-d6): 
29.5 (C-9), 40.7 (2'), 61.0 (5'), 70.2 (3'), 75.4 (C-8), 84.8 (!'), 87.5, 87.7 (4', C-7), 
97.7 (C-5), 113.9, 117.8 (q, CF 3 , J = 287.95 Hz), 144.2 (C-6), 149.5 (C-2), 155.9, 
156.4 (q, COCF3, J = 37.25 Hz), 161.7 (C-4). 
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5-(3- Trifluoroacetamidopropyn - 1 -yl )-uridine 

This compound was prepared similarly. 5-Iodouridine (1.35 mmol, 500 mg) was 
dissolved in dry DMF (12 mL) and Ar was bubbled through the solution for 10 min. Then, 
(Ph3P)4Pd (0.1 equiv., 0.1351 mmol, 156 mg) was added and Ar was bubbled through the 
solution for another 5 min. Triethylamine (2.0 equiv., 2.703 mmol, 273 mg, 0.376 mL) 
was added via syringe followed by addition of N-propargyltrifluoroacetamide (2.5 equiv., 
3.38 mmol, 510 mg and Cul (0.2 equiv., 0.27 mmol, 5L5 mg). The mixture was stirred at 
45°C for 3.5 h, then the solvent was evaporated, and the residue dissolved in 
MeOH/methylene chloride 1:1 (10 mL). Ion exchange resin (Bio-Rad AG1 X8, HCO3- 
form, 1.5 g) was added to remove Et3N-HI, and the mixture stirred at room temperature for 
20 min. The mixture was then filtered through Celite, the solid washed with 
MeOH/methylene chloride 1: 1 (10 mL) and the solvents removed by rotary evaporation. 
The residue was purified by column chromatography (chloroform/MeOH 8.25:1.75) to yield 
5-(3-trifluoroacetamidopropyn-l-yl)-uridine (ca. 95%) as a yellow foam contaminated with 
some Et 3 N-HL R f : 0.43 (chloroform/MeOH 8.25:1.75). *H-NMR (DMSO-d6): 3.61 (m, 
2H, 5'), 3.86 (m, 1H, 4'), 3.93-4.08 (m, 2H, 2\ 3'), 4.23 (m, 2H, H-9), 5.08 (d, 1H, 
OH), 5.19 (t, 1H, OH), 5.41 (d, 1H, OH), 5.75 (d, 1H, 1'), 8.48 (s, 1H, H-6), 10.10 (t, 
1H, NH chain), 11.64 (s, 1H, NH cycl.). 13 C .NMR (DMSO-d6): 29.5 (C-9), 60.5 (5*), 
69.6 (3'), 73.8 (2'), 75.3 (C-8), 85.0 (4>), 87.6, (C-7), 88.3 (1'), 97.8 (C-5), 113.9, 
117.7 (q, CF 3 , J = 288.48 Hz), 144.4 (C-6), 149.7 (C-2), 155.9, 156.4 (q, COCF3, J = 
36.80 Hz), 161.6 (C-4). 

5-( 3-Trifluoroacetamidopropyn-l-yl)-5 y -O-dimethoxytrityl-2 1 -deoxy-uridine 
5K3-Trifluoroacetamidopropyn-l-yl)-2'-deoxyuridine (1.34 mmol, 506 mg) was 

coevaporated with pyridine, then dissolved in dry pyridine (10 mL) and cooled to 0°C. 

Et 3 N (2 equiv., 2.684 mmol, 271 mg, 0.373 mL), dimethylaminopyridine (DMAP, 0.25 
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equiv., 0.3355 mol, 41 mg) and dimethoxytrityl chloride (DMTC1, 1.2 equiv., 1.61 mmol, 
545.1 mg) were added and the mixture stirred at 0°C for 5 min. and at room temperature for 
4 h. Thin layer chromatography (tic, chloroform/ 10% MeOH, Rf = 0.47) showed that all 

starting material had been consumed. MeOH (2 mL) was added and the solvents removed 
by rotary evaporation. The residue was partitioned (EtO Ac/aqueous NaHC03 solution), the 
combined organic layers washed with water, dried (Na2SC>4) and the solvent evaporated. 
The residue was purified by column chromatography (chloroform/ 10% MeOH) to give 5- 
(3-trifluoroacetamidopropyn-l-yl)-5'-0-dimeto^ (902 mg, 99%) as 

a yellow foam. iH-NMR (CDCI3): 2.44-2.61 (m, 2H, 2'), 3.34 (m, 2H, 5'), 3.73 (s, 6H, 
MeO), 3.89 (m, 1H, 4'), 4.14 (m, 2H, H-9), 4.59 (m, 1H, 3'), 6.34 (t, 1H, 1'), 6.80 (m, 
4H, DMT), 7.14-7.33 , 7.61-7.70 (m, 9H, DMT), 8.21 (s, 1H, H-6). 13 C -NMR (CDCI3): 
30.3 (C-9), 41.6 (2'), 55.2 (MeO), 63.5 (5'), 72.0 (3'), 75.3 (C-8), 86.0 (1'), 86.9 (Cq 
trityl), 87.0, (C-7), 87.3 (4'), 99.9 (C-5), 113.3 (DMT), 113.8, 117.6 (q, CF 3 , J = 286.52 
Hz), 126.9, 127.8, 128.0, 129.9, 135.4 (all DMT), 143.6 (C-6), 144.5 (DMT), 149.4 (C- 
2), 156.4, 156.9 (q, COCF3, J = 37.70 Hz), 158.5 (DMT), 162.6 (CM). 

5-( 3-Trifluoroacetamidopropyn-l-yl)-5 '-O-dimethoxytrityluridine 
5-(3-Trifluoroacetamidopropyn-l-yl)-uridine (1.272 mmol, 500 mg) was coevaporated 
with pyridine, then dissolved in dry pyridine (10 mL) and cooled to 0°C. Et3N (2 equiv., 
2.544 mmol, 257 mg, 0.354 mL), DMAP (0.25 equiv., 0.318 mol, 38.8 mg) and DMTC1 
(1.2 equiv., 1.526 mmol, 516.7 mg) were added and the mixture stirred at 0°C for 5 min. 
and at room temperature for 5 h. Analysis by tic (chloroform/10% MeOH, Rf = 0.33) did 

not show any starting material remaining. MeOH (2 mL) was added and the mixture 
evaporated. The residue was extracted (ethyl acetate/aqueous NaHC03 solution), the 
combined organic layers washed with water, dried (Na2S04) and the solvent evaporated. 
The residue was purified by column chromatography (chloroform/ 10% MeOH) to give 286 
mg (32%) 5-(3-trifluoroacetamidopropyn- 1 -yl)-5 ' -O-dimethoxytrityluridine as a yellow 
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foam. iH-NMR (CDCI3): 3.32 - 3.41 (m, 2H, 5'), 3.70 (s, 6H, MeO), 3.80-3.88 (m, 1H, 
4'), 4.19 (m, 2H, H-9), 4.35-4.44 (m, 2H, 2\ 3'), 5.91 (m, 1H, V), 6.79 (d, 4H, DMT), 
7.19-7.71 (m, 9H, DMT), 8.18 (s, 1H, H-6). 13 C-NMR (CDCI3): 30.2 (C-9), 55.2 
(MeO), 62.7 (5'), 70.5 (3'), 74.9 (2'), 75.5 (C-8), 84.3 (4'), 86.9 (Cq trityl), 87.6, (C-7), 
90.0 (1'), 99.1 (C-5), 113.3 (DMT), 113.6, 117.4 (q, CF 3 , J = 286.90 Hz), 126.9, 127.8, 
128.0, 129.9, 135.4 (all DMT), 143.6 (C-6), 144.4 (DMT), 149.2 (C-2), 156.4, 157.0 (q, 
£QCF 3 , J = 38.23 Hz), 158.5 (DMT), 162.7 (C-4). 

3 '-0-Acetyl-5-( 3-trifluoroacetamidopropyn-l-yl)-5 '-O-dimethoxytrityl-2 '-deoxyuridine 
5-(3-Trifluoroacetamidopropyn- 1 -yl)-5' -O-dimethoxytrityl-2 '-deoxyuridine ( 1 .328 
mmol, 902 mg ) was coevaporated with pyridine and then dissolved in pyridine (10 mL) and 
DMAP (0.25 equiv., 0.332 mmol, 40.5 mg), Et 3 N (2.5 equiv., 3.32 mmol, 335 mg, 0.462 
mL) and AC2O (1.2 equiv., 1.594 mmol, 162.5 mg, 0.15 mL) were added. The mixture 
was stirred at room temperature for 2.5 h (TLC: chloroform/ 10% MeOH, Rp = 0.75). 

MeOH (5 mL) was added to stop the reaction and the solvents were removed by rotary 
evaporation. The residue was extracted (water/ethyl acetate), the organic layer dried 
(Na2S04) and the solvent evaporated. The residue was purified by column chromatography 
(chloroform/ 10% MeOH) to give 3'-0-acetyl-5-(3-trifluoroacetamidopropyn-l-yl)-5'-0- 
dimethoxytrityl-2' -deoxyuridine (937 mg, 98%) as a yellow foam. iH-NMR (CDCI3): 2.08 
(s, 3H, Ac), 2.38-2.64 (m, 2H, 2'), 3.42 (m, 2H, 5'), 3.77 (s, 6H, MeO), 3.96 (m, 1H, 
4'), 4.18 (m, 2H, H-9), 5.45 (m, 1H, 3'), 6.34 {u 1H, 1'), 6.85 (m, 4H, DMT), 7.23- 
7.48 , 7.63-7.71 (m, 9H, DMT), 8.20 (s, 1H, H-6). 13 C-NMR (CDCI3): 20.8 (Ac), 30.2 
(C-9), 38.7 (2'), 55.1 (MeO), 63.5 (5'), 74.9 (3'), 75.1 (C-8), 84.4 (1'), 85.3 (4'), 87.1 
(Cq trityl), 87.2 (C-7), 99.4 (C-5), 113.3 (DMT), 113.6, 117.4 (q, CF 3 , J = 287.42 Hz), 
126.9, 127.7, 128.0, 129.9, 135.2 (all DMT), 143.1 (C-6), 144.3 (DMT), 149.4 (C-2), 
156.3, 156.8 (q, £0_CF 3 , J = 37.78 Hz), 158.6 (DMT), 162.0 (C-4), 170.3 (Ac). 
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2 ',3 '-Di-0-acetyl-5-( 3-trifluoroacetamidopropyn-l-yl)-5 '-O-dimethoxy-trityluridine 
5<3-Trifluoroacetamidopropyn-l-yl)-5'-0-dimethoxytrityluridine (0.4115 mmol, 286 
mg ) was coevaporated with pyridine and then dissolved in pyridine (10 mL) and DMAP 
(0.5 equiv., 0.206 mmol, 25 mg), Et 3 N (5 equiv., 2.058 mmol, 208 mg, 0.886 mL) and 
Ac 2 0 (2.4 equiv., 0.988 mmol, 100.68 mg, 0.093 mL) were added. The mixture was 
stirred at room temperature for 1 h (TLC: chloroform/ 10% MeOH, R f = 0.77), then MeOH 
(5 mL) was added to stop the reaction and it was evaporated to dryness. The residue was 
extracted (water/ethyl acetate), the organic layer dried (Na 2 S0 4 ) and the solvent evaporated. 
The residue was purified by column chromatography (chloroform/ 10% MeOH) to give 
2' ,3'-di-0-acetyl-5-(3-trifluoroacetarnidopropyn- l-yl)-5'-0-dimethoxy-trityluridine (285 
mg, 89%) as a yellow foam. ^H-NMR (CDC1 3 ): 2.15 (s, 3H, Ac), 2.17 (s, 3H, Ac), 3.41- 
3.56 (m, 2H, 5'), 3.82 (s, 6H, MeO), 3.95-4.05 (m, 1H, 4'), 4.31 (d, 2H, H-9), 5.65- 
5.68, 5.76-5.81 (m+m, 2H, 2', 3'), 6.29 (d, 1H, 1'), 6.90 (d, 4H, DMT), 7.32-7.44 , 
7.72-7.79 (m, 9H, DMT), 8.18 (s, 1H, H-6). 13 C -NMR (CDCI3): 20.3, 20.4 (2 x Ac), 

30.1 (C-9), 55.1 (MeO), 62.7 (5'), 71.2 (3'), 73.0 (2'), 74.7 (C-8), 82.3 (4'), 85.6 (1'), 

87.2 (Cq trityl), 87.4 (C-7), 99.9 (C-5), 113.3 (DMT), 109.8, 113.6, 117.4, 121.2 (q, 
CF 3 , J = 287.50 Hz), 126.9, 127.8, 128.0, 129.8, 134.8, 136.5 (all DMT), 142.9 (C-6), 
143.9 (DMT), 148.8 (C-2), 155.8, 156.3, 156.8, 157.3 (q, COCF3, J = 37.25 Hz), 158.5 
(DMT), 161.9 (C-4), 169.7 (Ac). 

3 '-0-Acetyl-5-( 3-trifluoroacetamidopropyn-l-yl)-2 ' -deoxyuridine 

3'-0-Acetyl-5-(3-trifluoroacetamidoprop^ 
(1.30 mmol, 937 mg) was dissolved in dry MeOH (1 mL). A solution of anhydrous HC1 in 
anhydrous MeOH (10%, 1 mL) was added. TLC showed that the reaction was completed 
after 3 min. (TLC, chloroform/ 10% MeOH, Rp = 0.46). The mixture was cooled to 0°C 
and aqueous NaHC0 3 solution was added to adjust pH 7. The mixture was extracted (ethyl 
acetate), the organic phase separated and dried (Na 2 S0 4 ), and the solvents removed by 
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rotaiy evaporation. The residue was purified by column chromatography (chloroform/ 10% 
MeOH) to give 3'-0-acetyl-5-(3-trifluoroacetamLdopropyn-l-yl)-2'^eoxyuridine (371 mg, 
68%) as a slightly yellow foam. ! H-NMR (CD 3 OD): 2.03 (s, 3H, Ac), 2.18-2.40 (m, 2H, 
2'), 3.71 (m, 2H, 5'), 4.02 (m, 1H, 4'), 4.19 (d, 2H, H-9), 5.21 (m, 1H, 3'), 6.14 (t, 1H, 
V), 8.25 (s, 1H, H-6). 13 C -NMR (CD3OD): 20.9 (Ac), 30.7 (C-9), 39.0 (2'), 62.8 (5'), 
76.0 (3'), 76.4 (C-8), 86.9 (1'), 87.1 (C-7), 88.5 (4'), 99.9 (C-5), 111.6, 115.4, 119.2, 
123.0 (q, CF 3 , J = 285.99 Hz), 145.5 (C-6), 151.1 (C-2), 158.2, 158.7 (q, £0_CF 3 , J = 
37.78 Hz), 164.4 (C-4), 172.2 (Ac). Fab+-MS: 420.1026 ((M+l)+) 

2 ', 3 '-Di-0-Acetyl-5-(3-trifluoroacetamidopropyn-l-yl)-uridine 

2\3'-Di-0-aceryl-5-(3-trifluoroacetaimdopro 
(0.366 mmol, 285 mg) was dissolved in dry MeOH (1 mL). A solution of anhydrous HC1 
in anhydrous MeOH (10%, 1 mL) was added at 0°C and the reaction was completed after 5 
min. (TLC, chloroform/ 10% MeOH, R f = 0.32). Aqueous NaHC0 3 solution was added to 
adjust pH 7 at 0°C. The aqueous phase was extracted (ethyl acetate), the organic phase 
separated and dried (Na 2 S0 4 ), and the solvent evaporated. The residue was purified by 
column chromatography (chloroform/ 10% MeOH) to give 2\ 3'-di-0-acetyl-5-(3- 
trifluoroacetaimdopropyn-l-yl)-uridine (144 mg, 82.5%) as a slightly yellow foam. lH- 
NMR (CDCI3): 2.06 (s, 3H, Ac), 2.11 (s, 3H, Ac), 3.82-4.01 (m, 2H, 5'), 4.21 (m, 1H, 
4'), 4.28 (m, 2H, H-9), 5.44 (m, 2H, 2', 3'), 6.04 (d, 1H, 1'), 8.16 (s, 1H, H-6). 13 C - 
NMR (CDCI3): 20.2, 20.4 (2 x Ac), 30.3 (C-9),„61.1 (5'), 70.6 (3'), 73.4 (2'), 74.8 (C- 
8), 83.4 (4'), 87.6 (1'), 87.9 (C-7), 99.2 (C-5), 110.0, 113.8, 117.6, 121.4 (q, CF 3 , J = 
287.42 Hz), 144.4 (C-6), 149.5 (C-2), 156.6, 157.0, 157.5, 158.0 (q, £0_CF 3 , J = 37.25 
Hz), 162.5 (C-4), 170.1, 170.2 (2 x Ac). 

5-(3-aminopropyn-l-yl)-2 '-deoxyuridine triphosphate 
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Triphosphates were synthesized by the method of Ludwig and Eckstein (J. Ludwig and 
F. Eckstein (1989) /. Org. Chem. 54, 631). S'-O-Acetyl-S-CS-trifluoroacetamidopropyn-l- 
yl^'-deoxyuridine (0.1 mmol, 41.9 mg) was coevaporated with pyridine and dissolved in 
dry pyridine (0.1 mL)/dry 1,4-dioxane (0.3 mL). 2-Chloro-4H- 1,3,2- 
benzodioxaphosphorin-4-one (1.1 equiv., 0.11 mL 1 M solution) was added. A precipitate 
was formed and the mixture was stirred for 10 min. Then, tributylammonium 
pyrophosphate (70 mg) in dry DMF (0.35 mL)/tributylamine (0.1 mL) was added, giving a 
clear solution which was stirred for 10 min. An 1% solution of iodine in pyridine/water 
98:2 (2.2 mL) was added and the immediately decolorized solution was stirred for another 
15 min. After addition of a few drops of a 5% Na2SC>3 solution it was evaporated at room 
temperature, the residue taken up in water (5 mL) and stirred at room temperature for 30 
min. Then, concentrated aqueous ammonia (20 mL) was added and the mixture stiiTed at 
50°C for 4 h. The organic solvents were removed by rotary evaporation at room temperature 
and the residue purified by column chromatography (Sephadex DEAE cellulose, previously 
equilibrated with TEAB buffer) using a 0.2 M-0.6 M gradient of triethylammonium 
bicarbonate (TEAB) buffer solution. The UV active fractions were evaporated at room 
temperature to yield 5-(3-ammopropyn-l-yl)-2'-deoxyuridine triphosphate, contaminated 
with some diphosphate. 

Example 2. Synthesis of derivatives of 2'-deoxycytidine suitable for incorporation into 
an oligonucleotide library. 
S~Propargyl thioacetate 

A solution of triphenylphosphine (2.142 g, 8.17 mmol) in anhydrous tetrahydrofuran 
(THF, 33 mL) was treated at 0°C dropwise with diethylazodicarboxylate (DIAD, 1.608 mL, 
8.17 mmol). A white precipitate was formed. The mixtue was stirred for 30 min, and a 
solution of CH3COSH (622 mg, 0.584 mL, 8.17 mmol) and propargyl alcohol (0.317 mL, 
5.44 mmol) in 20 mL anhyrous THF was added over a period of 20 min. The mixture was 
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stirred for 2 h at 0°C (a clear solution resulted) and then for another 4 h at room temperature 
(RT). Methanol (0.1 10 mL) was then added, and the solvents removed by rotary 
evaporation (50°C, 200 mbar). The residue (ca. 8 g of a yellow liquid) was stored at -20°C 
for 12 h, during which most of the POPh3 crystallized. The liquid was removed by 
decantation, the crystals washed with petroleum ether (4 times), the liquid and washings 
were combined, and the solvents removed by rotary evaporation. The residue was 
chromatographed (silica, eluting first with petroleum ether, then with petroleum ethenether 
99.5:0.5) to yield the propargyl S-thioester of thioacetic acid (577 mg, 5.06 mmol, 93%) as 
a light yellow liquid. 

S- Acetate of5-(3-mercaptopropyn-l-yl)-2 '-deoxycytidine 

A solution of 5-iodo-2*-deoxycytidine (Sigma, 92 mg, 0.260 mmol) and Pd(PPh3)4 (30 
mg, 0.026 mmol) in absolute DMF (1.3 mL) was treated sequentially with triethylamine 
(0.072 mL, 53 mg, 0.520 mmol), the propargyl S-thioester of thioacetic acid (74 mg, 0.650 
mmol), and Cul (10 mg, 0.052 mmol). The mixture was stirred at RT (4 h), the solvents 
were removed in high vacuum, and the residue was chromatographed (silica, 
CH2Cl2:MeOH 9:1) to yield the S-acetate of 5-(3-mercaptopropyn-l-yl)-2'-deoxycytidine 
(77 mg, 0.227 mmol, 88%) as a light yellow oil. 

The mixed disulfide between tert-butylthiol and 5-(3-mercaptopropyn-l-yl)-2 
deoxycytidine 

Under argon, a solution of (tBu)OCON(S-tBu)-NHCOOtBu (L753 g, 5.470 mmol) in 
acetonitrile (18 mL) was treated with concentrated aqueous ammonia (29%). To this mixture 
was added the S-acetate of 5-(3-mercaptopropyn-l-yl)-2' -deoxycytidine (416 mg, 1.226 
mmol) in a CH3CN:H20 mixture (9:1, total volume 40 mL) After 6.5 h, the pH of the 
reaction mixture was adjusted to 5 with aqueous HC1, and the solvents were removed by 
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rotary evaporation. Chromatography of the residue yielded the mixed disulfide between tert- 
butylthiol and 5-(3-mercaptopropyn-l-yl)-2'-deoxycytidine (128 mg, 0.332 mmol, 27%). 

The mixed disulfide between tert-butylthiol and 5-(3-mercaptopropyn-l-yl)-2'- 
deoxycytidine (128 mg, 0.332 mmol) in anhydrous DMF (2.6 mL) is treated dropwise with 
DMF-dimethylacetal (0.640 mL, 3.73 mL). After incubation for 1 h, the solvents were 
removed under high vacuum to yield the mixed disulfide between tert-butylthiol and 5-(3- 
mercaptopropyn-l-yl)-2'-deoxycytidine with the 4-amino group of the nucleobase protected 
as the dimemylformamidine derivative. 

This derivative is converted to the triphosphate of the modified cytidine nucleoside via the 
procedure described above for the 2'-deoxyuridine derivative. 

Example 3. Synthesis of derivatives of 2'-deoxyadenosine suitable for incorporation into 
an oligonucleotide library. 

Solutions of 0 6 -phenyl-2'-deoxyinosine (Ferentz, A., Verdine, G. L. Nucleosides 
Nucleotides (1992) 1 1, 1749-1763) (each 1 mmol) in DMF-water (20 mL) are individually 
treated with the mxde disulfide between t-butyl thiol and cysteamine, hexylamine, 
phenemylamine, and the monotrifluoroacetamido derivative of ethylenediamine (1 M final 
concentration of each) to generate the corresponding 2'-deoxyadenosine derivatives. These 
are converted to the corresponding derivatives of 2'-deoxyadenosine carrying at the N(6) 
position the substituents -CH 2 -CH 2 -SS-C(CH 3 ) 3 , (CH 2 )5CH 3 , -CH 2 CH 2 -phenyl, and 
-CH 2 CH 2 NHCOCF 3 , respectively. These are converted to the corresponding triphosphates 
following the procedure described above. 

Example 4. Library of oligonucleotides containing functionalized nucleobases 
The oligonucleotide below containing a spacer (Seql) and a 5'-primer binding site 
(5'PBS) was prepared. 
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5 ' - ( Seql ) TTTTTCTAGAGA- ( 5 ' PBS ) -3 1 

A solution containing approximately 60 nmol of these was then treated with terminal 
transferase, where a mixture of 2 -deoxynucleoside triphosphates (10 ^iM total) following 
the procedure in Maniatis et al. (Molecular Cloning, (1982) Cold Spring Harbor Laboratory, 
p. 148). The product was a library of oligonucleotides to which a randomized sequences 
(RanSeq) was appended with an average length of ca. 200 bases. 

5 1 - ( Seql ) TTTTTCTAGAGA- ( 5 ' PBS ) - ( RanSeq) -3 * 

After the pool of mixed triphosphates is consumed, 20 equivalents of TIP was added, 
and the polymerization is continued to affix an oligo-T tail to each of the supported 

M 

oligonucleotides. 

5 ' - (Seql) TTTTTCTAGAGA- (5 1 PBS) - (RanSeq) - (T) 2o~3 ' 

The library was then examined by agarose gel electrophoresis on agarose using ethidium 
bromide staining, which revealed a range of oligonucleotides with lengths of 100 to 200 
bases. To assess the overall composition of the library, a sample of the mixture was treated 
with deoxyribonuclease, the nucleotide products dephosphorylated using alkaline 
phosphatase, and the nucleoside products resolved^nd quantitated by high performance 
liquid chromatography. 

The library is then modified by repeating this procedure, but incorporating the 
triphosphosphate of 3-P-D^2'-deoxyribofuranosyl)-(2,6-diarninopyriniidine) (5% by mole 
relative to the total of the triphosphates) and the triphosphate of 5K3-aminopropyn-l-yl)-2'- 
deoxyuridine (5% by mole relative to the total of the triphosphates). Thus, the incubation 
mixture presented to the terminal transferase contains 60 nmol of 5 ' - 
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(Seql ) TTTTTCTAGAGA- (5 ' PBS) -3 1 , 3-p-D-(2'-deoxyribofuranosyl)-(2,6- 
diaminopyrimidine) triphosphate (500 nmol), 5-(3-aminopropyn-l-yl)-2'-deoxyuridine 
triphosphate (500 nmol), ATP (3 nmol), GTP (3 nmol), and CTP (3 umol). 

Example 5. 

The library from example 4 is modified similarly, but with a nucleoside and nucleoside 
analog triphosphate pool comprised of 5-(3-mercaptopropyn-l-yl)-2'-deoxyuridine 
triphosphate (500 nmol), ATP (3 nmol), GTP (3 nmol), and TTP (3 nmol). 
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